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© Error correcting code for B-bit-per-chip memory with reduced redundancy. 

© A reduced redundancy error correction and detection code is shown for memory organized with several bits 
of the data word on each chip. This package error correction and detection will correct all errors on any one chip 
and detect errors on more than one chip. A certain arrangement of an ECC matrix is first created for a symbol 
size code greater than the number of bits per chip. Thereafter certain columns of the matrix are removed to 
create the final code having a symbol size the same as the number of bits per chip. A specific example of an 80 
bit code word is shown having 66 data bits and 14 check bits for a 4-bit-per-chip memory. 
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ERROR CORRECTING CODE FOR B-BIT-PER-CHIP MEMORY WITH REDUCED REDUNDANCY 



Background Of The Invention 



5 This invention relates to error detection and correction and in particular to symbol error correction. A 
symbol is a subset of adjacent bits in a data word and the term package error detection and correction is 
sometimes used. In particular, this invention relates to reduced redundancy symbol error detection ana 
correction code which has particular application to a memory organized on a b-bit per chip basis where the 
symbol size is b. 

10 Applicants earlier U.S. Patent number 4.464,753 which is assigned to the same assignee as this 
application shows a general scheme for package error correction and an application to a symbol size of 2. 
U.S. Patent number 3,634,821 also shows a b-adjacent code. 

Applicants earlier U.S. Patent number 4,509.172 assigned to the same assignee as this application 
shows a code for package error detection with reduced redundancy. However, this patent does not show 

75 package error correction and is different from the present invention. 

A package error correction system for a 4 bit per package memory is shown in U.S. Patent number 
4,617,664 assigned to the same assignee as this invention. This patent does not shown two symbol error 
detection, however. In addition, the present invention represents a reduced redundancy over the system 
shown in the patent. 

20 Other prior art U.S. patents known to Applicant are 3,755,779; 3,745.525; 3.623,155 and 3,629.824 none 
of which, however, anticipate the present invention. 

U.S. Patent number 4,661 .955 assigned to the same assignee as the present invention, shows a system 
for detecting and eliminating soft errors in a package ECC. 

The IBM Technical Disclosure Bulletin also contains articles dealing with package error detection or 
25 correction. In particular, "SEC-DED Codes With Package Error Detection Ability". November 1979. pages 
2356-2359; "Optimized Error Correction/ Detection For Chips Organized Other Than By-1", March 1982, 
pages 5275-5276 and "Dual-Mode Error Correction and Error Detection". June 1985. pages 55-58 illustrate 
the state of the art but do not anticipate the present invention. 

Because of the cost of a system, such as a memory, is dependent on the number of bits required for 
30 error detection and correction, any reduction in the resource required for the system will produce a cost 
savings. Therefore, it is desirable to have an improved error correction and detection system with reduced 
redundancy to save on costs in a system, such as a memory. 



35 Summary of The Invention 



This invention is a symbol or package error correcting and detecting code having a reduced 

redundancy. An embodiment of the code is shown in connection with four bit-per-chip memories where an 
40 SSC-DSD (single symbol correction - double symbol detection) code is shown for 66 data bits with 14 extra 

bits to form an 80 bit code word. This represents a savings of 2 redundancy bits as compared to the prior 

art. Thus, there may be two additional data bits in an 80 bit code word. 

The code according to the invention is formed by first preparing the general form H matrix for a b-bit- 

per package SSC-DSD code according to the prior art where the symbol size is b, the code length is bN 
45 where there are N packages and the number of check bits is r. Then this general form H matrix is 

converted to a standard form H(s) matrix where the first non-zero bxb submatrix of each of the N b-column 

groups of the matrix is the bxb identity matrix. 

The new code is constructed from H(s) by removing the same set of e columns from each b-column 

group of the H(s) matrix to form an intermediate matrix H(l) where e is the number of reduced bits from the 
so general H matrix SSC-DSD code. Finally, remove the e all zero rows from the matrix H(l) just formed to 

create a resultant matrix H(r) with symbol size b-e and code length (b-e)N and the number of check bits 

equal to r-e. 
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The Drawings 

FIGURES 1A and 1B, viewed in left to right relationship, represent an error correction and detection 
5 code matrix for producing check bits for a particular example of a code according to the invention. 

FIGURES 2A and 2B, viewed in left to right order, represent a resultant matrix for error correction and 
detection according to the present invention. 

FIGURE 3 shows the mapping of bit positions to memory chips for a 4-bit per chip" memory using the 
code shown in FIGURES 2A and 2B for a particular embodiment according to the present invention. 
10 FIGURE 4 is a block diagram of part of the circuit for the i symbol where i = 1 or 4 - 20 for 

generating error signals and identifying uncorrectable errors (UEs) for a code in a system according to an 
embodiment of the present invention as shown in FIGURES 2A and 2B and organized as shown in FIGURE 
3. 

FIGURE 5 is a block diagram of the remainder of the circuit for the i symbol where i = 2 and 3 
/5 shown in FIGURE 4 according to the present invention. 

FIGURE 6A is' the H(i) matrix, where i = 20, for multiplication with the S1 signal as shown in FIGURE 
4 according to the present invention. 

FIGURE 6B is the logic required to perform the multiplication of the matrix shown in FIGURE 6A by 
the S1 signal as shown in FIGURE 4 according to the present invention. 

20 

Description of the Preferred Embodiment 



25 A computer memory designed with an error correcting code (ECC) is conventionally organized in a one- 
bit-per-chip fashion with respect to the ECC words. This organization guarantees that at most one bit of an 
ECC word would be corrupted by a chip failure. The class of SEC-DED (single error correcting and double 
error detecting) codes are normally used to control the errors in the one-bit-per-chip memory systems. 
The trend in memory chip design has been toward denser and bigger chips. The designs of memory 

30 chips that can store 4 megabits and 16 megabits have already been reported. If 4 megabit chips are used 
to design a one-bit-per-chip memory system with an 8 data byte bandwidth, the minimum system capacity 
would be 32 megabytes. However, many systems do not require 32 megabytes of storage. As a result, the 
system designer would be forced to store multiple bits of the same ECC word in the same memory chip. 
Another advantage of a multiple-bit-per-chip memory organization is that a fewer number of chips has to be 

35 powered on at a time. In the future, it will be common for memory systems to be organized in a b-bit-per- 
chip fashion, where b is greater than one. 

In a b-bit-per-chip memory organization, a chip failure would generate a symbol error, an error pattern 
consisting of one to b errors, in the data word. The conventional SEC-DED codes would not be able to 
effectively control symbol errors. Also, if b is greater than 2, a symbol error may be miscorrected by an 

40 SEC-DED code. Thus, there is a potential of losing data integrity. 

The appropriate ECC for a b-bit-per-chip memory design is the class of SSC-DSD codes that can 
correct all single symbols errors and detect all double symbol errors in an ECC word, where a symbol error 
is a b-bit error pattern generated from a chip failure. An SSC-DSD code is capable of correcting all the 
errors generated from any single chip failures and detecting all the errors generated from two chip failures. 

45 A technique is described for the construction of SSC-DSD codes that are more efficient than previous 
known codes. That is, a new code requires a fewer number of check bits for the same amount of data bits, 
or a new code can protect more data bits with the same number of check bits as compared to a known 
SSC-DSD code. For an example, a known SSC-DSD code for 64 data bits and b = 4 requires 16 check bits. 
A comparable new code according to this invention requires only 14 check bits. As another example, for 

so b=4 and 20 check bits, the most efficient known SSC-DSD code can protect 2036 data bits, while a new 
code according to this invention with 19 check bits can protect 4081 data bits. 

A code word of an SSC-DSD code consists of N b-bit symbols, or bN bits. Let C = (C1, C2 CN) be 

a code word, where each component Ci, 1£ i£ N, of C is a symbol of b-bit pattern. The code word must 
satisfy a set of linear equations that can be expressed as: 

55 H • C 1 = 0 mod 2, (1) 

where C* is the transpose of C, and H, the parity check matrix of the code, is a binary matrix with bN 
columns. The columns of H can be divided into N b-column groups. If the rank of H is equal to r, then the 
code has r check bits and bN-r data bits. The code is conventionally denoted as a (bN, bN-r) code, where 

3 
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bN is the code length and bN-r is the number of data bits. 

The symbol weight of a vector is the number of nonzero symbols in the vector. The symbol aistance of 
two vectors is the number of symbol positions in which the two vectors are different. The minimum symbol 
distance of a code is the minimum of the symboi distance between any two code words of the code. A 
code is an SSC-DSD code if the minimum symbol distance of the code is equal to or greater than four. 

The parity check matrix of a known cede constructed according to the prior art has the following 
general form: 



jo 



75 



H = 



A i: L i2 

Mo 1 0 *5 



T R1 T R2 



T 1S 



. . T 



RN 



(2) 



where each of the b x b submatrix Tjj is either an all zeros matrix or a power of the companion matrix of a 
primitive polynomial of degree b. For an example, the companion matrix of the primitive polynomial 1 + x 
+ x* is: 



0 0 0 1 
10 0 1 
0 10 0 
, 0 0 1 0 

All 15 distinct powers of this companion matrix are shown in Table 1. 

Note that T° is the identity matrix and V is used to denote the all zeros matrix. Note, also that the 
inverse of T is T" 1 =T 14 , and T'^T 15 " 1 Table 2(A) shows the general parity check matrix, in terms of T of 
35 Table 1, for a (44.32) SSC-DSD code with symbol size b = 4. Table 2(B) shows the parity check matrix in 
standard form to be explained later. 

A word W read from the memory at a particular location may not be the same as the code word C 
originally written into the same location, because errors may be generated from physical failures. The 
difference between W and C is defined as the error pattern E. Let W = (Wi, W 2 W N ) and E = (Ei , E2, 

E N ) Then W = C + E, and ^ = W } - Cj for 1S i S N. From (1), we have 
S = H • W l 

= H •(C t + E x ) (3) 

= H • E l , 

The vector S is called the error syndrome. The vector S is calculated from the parity check matrix H 
a5 and the word W read from the memory, and it is independent of the original code word C that was stored in 
the memory. If there is no error in W, i.e. E = 0, then S is an all zero vector. On the other hand, if S is not 
an all zero vector, then errors in W are detected. In this case, the error correction process is to determine 
first the error pattern E t then to recover the original code word C by the bit-by-bit exclusive or (XOR) 
operation of W and E, which is equivalent to saying C = W - E. 
CQ The parity check matrix of an SSC-DSD code with symbol size b and code length bN is said to be in a 
standard form if the first non-zero bxb submatrix of each of the N b-column groups of the matrix is the b x b 
identity matrix. If the parity check matrix H is first generated in the general form of equation (2) above, the 
matrix can be transformed into a standard form using the following algorithm. 
1 . Set j = 0. 
55 2. Let j=j + 1. If j > N. exit. 

3. Let k be the smallest integer i, 12 i 2 R, such that is a non-zero matrix. Replace T^ by T fj • - 
(Tkj)" ? mod 2 for k£ i £ R, where (T)" 1 denotes the inverse of matrix T. 



25 
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4. Go back to step 2. 

Applying the algorithm, the parity matrix of Table 2(A) can be transformed into a standard form as 
shown in Table 2(B). If the original parity check matrix defines an SSC-DSD code, the transformed matrix in 
standard form also defines an SSC-DSD code with the same number of check bits. 

A new code is now constructed from a known code. Let H be the parity check matrix of a known (b N, 
b'N-r) SSC-DSD code with a symbol size of b\ The number of check bits for the code is equal to r. The 
procedure to construct a new SSC-DSD code with a symbol (b'-e) = b is as follows: 

1. Transform matrix H into a standard form. 

2. Delete consistently the same set of e columns from each b -column group of the matrix in 
standard form H(s). Then delete the e all zeros rows from the resultant matrix. Let H(r) be the final resultant 
matrix. Then H(r) is the parity check matrix of an SSC-DSD code with symbol size (b -e) = b and code 
length (b'-e)N. The number of check bits is equal to r-e. In general, e can be 1 or greater than 1. 

If the fourth column of each of the 11 column groups of the matrix in Table 2(B) is deleted, and the 
fourth row of the matrix is also deleted, the matrix becomes the form shown in Table 3. The new matrix 
defines a (33,22) SSC-DSD code with symbol size b = 3. 

Table 4 shows the parameters of some new and more efficient SSC-DSD codes that can be constructed 
from the technique described in this section. 

A memory system is to be organized in 4 bits per chip, and an SSC-DSD code is to be used to protect 
66 data bits. A previously known code would require 16 check bits. From Table 4, a new code of length 136 
with 14 check bits can be constructed. This code can be shortened to protect 66 data bits, resulting in a 
(80,66) code that saves 2 check bits as compared to the best previously known code. Also for an 80 bit 
code word, two additional data bits may be provided. The construction of a (80,66) SSC-DSD code with 
b = 4 is described below. 

The primitive polynomial 1 + x 2 + x 5 , is used along with its companion matrix. 

0 0 0 C 1 

1 0 0 0 0 
T - 0 1 0 0 1 (4) 

0 0 10 0 
0 0 0 1 0 



Let b' = 5, R = 3, and N = 34. The parity check matrix of the form (2) can be designed to define an SSC- 
DSD code with a symbol size of 5. The entries of the parity check matrix are powers of T or the 5 x 5 all 
zeros matrix. Only 20 of the 34 column groups are selected to form the following matrix. 
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18 
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30 


29 
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28 
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27 
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17 


7 


16 


8 



45 Let H be the 15 x 100 binary matrix obtained from the mapping of the above matrix according to the 
following rules: map " into a 5 x 5 all zeros matrix, and map i into T\ for 05 i S 30. where T is the matrix of 
(4). Matrix H defines a (100,85) SSC-DSD code with b' =5. Now. delete the fourth row of H and delete also 
every other fifth column of H. The resultant matrix H2 is shown in FIGURE 2. This 14 x 80 matrix is the 
parity check matrix of a (80.66) SSC-DSD code with b = 4. 

so In memory system design, it may be desirable to check byte parity in a speedy way. In this case, it is 
also desirable to be able to include some byte parities as interim results in the generation of ECC check 
bits. Byte parities do not need to be generated to transfer data. For this consideration, we permute the 
columns of the matrix in FIGURE 2 to obtain the matrix H1 in FIGURE 1. Matrix H1 will be used to generate 
the ECC check bits. 

55 Refer to matrix H1, any set of 14 bit-positions can be designated as the locations of check bits as long 
as the corresponding 14 columns of H1 form a non-singular matrix, i.e., those 14 columns are linearly 
independent. Designate positions 1-12, 21 and 30 as ECC check bit positions. Note that each of the first 
four ECC check bits can be obtained from the parities of two data bytes (the vertical lines in H1 define the 
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data byte boundaries). For example, ECC bit 4 is the sum (XOR) of byte parity of bits 40-48 and the byte 
parity of bits 73-80. 

Matrix H1 provides rules for the generation of the fourteen ECC check bits. From (1), the XOR sum of a 
code word at the positions indicated by the ones in each row of H1 has to be equal to 0. Thus, the rows of 
5 H1 define 14 equations that a code word has to satisfy. The ECC check bits are derived from these 
equations. For example, check bit 5 is the XOR sum of data bits at positions 15. 17. 19. 20. 23. 26-28, 35, 
38-40, 43, 46. 48. 51. 52, 55, 56, 58. 61, 66. 68. 72, 78 and 79. 

The syndrome of a code word read from the memory is the XOR of the ECC bits read directly from the 
memory and the ECC bits generated from the data read from the memory. It can be verified that the 
io syndromes of errors in ail single chips are nonzero and distinct and the syndromes of errors in ail double 
chips are not the same as any one of the correctable error syndromes. 

If the fourteen syndrome bits are all zeros, no error is assumed. If the syndrome is nonzero, there are 
errors in the code word. In this case, the chip positions and bit positions of the errors have to be iaentified. 
All double chip errors and some multiple chip errors will be detected as UE (uncorrectable errors), in the 
15 following description, the first four bits of the syndrome are called S1 and the last ten bits of the syndrome 
are called S2. 

The matrix of FIGURE 2 is used for processing the syndrome in error correction. The matrix is divided 
into 20 submatrices, one for each chip, of four columns. Each submatrix consists of a 4x4 identity matrix 
and a 10x4 matrix H(i) for i = 1,2 20. To check if a chip i is in error, H(i) is multiplied by S1, and the 

20 resultant ten bit pattern is compared with S2. If there is a total agreement, chip i is in error and S1 is the 
four bit error pattern within chip i. If there is a disagreement, then chip i is error-free. FIGURE 3 shows the 
assignment of code word bit positions to chips for this embodiment. The block diagram of FIGURE 4 shows 
the logic for the identification of errors in chip i for i = 1 and 4-20. FIGURE 5 shows the logic for chips 2 and 
3. If the syndrome is nonzero and there is no error indicator in each of the twenty chips, then the UE signal 

25 should be on for an uncorrectable error. 

The multiplication H(i) by S1 can be implemented with an XOR tree. FIGURE 6B is an illustration of the 
multiplication for i = 20. H(i=20) is shown in FIGURE 6A. The 1-10 individual output bits are shown at the 
bottom of the XOR blocks while the 1-4 input bits are shown at the top. 
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Powers Of The Companion Matrix Of 1 + x 



T* = 0 = 



0 0 0 0 

0 0 0 0 

0 0 0 0 

0 0 0 0 

0 0 0 1 

10 0 1 

0 10 0 

0 0 10 



10 0 0 
0 10 0 
0 0 10 
0 0 0 1 

0 0 10 
0 0 11 
10 0 1 
0 10 0 



20 



0 10 0 
0 110 
0 0 11 
10 0 1 



10 0 1 
110 1 
0 110 
0 0 11 



25 



30 



35 



T' = 



0 0 11 
10 10 
110 1 
0 110 

110 1 
10 11 
0 10 1 
10 10 



0 110 
0 10 1 
10 10 
110 1 

10 10 
0 111 
10 11 
0 10 1 



40 



1 = 



0 10 1 

1111 

0 111 
10 11 



,10 



10 11 
1110 

1111 

0 111 



45 



50 



,11 



,13 _ 
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0 111 
110 0 
1110 

1111 

1110 
0 0 0 1 
10 0 0 
110 0 



,12 



,14 



1111 

10 0 0 
1.10 0 
1110 

110 0 
0 0 10 
0 0 0 1 
10 0 0 
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TABLE 2 

Parity Check Matri:: Of A , 22) SSC-DSD C~r:Q With b=± 
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(A) General Form 
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TABLE 4 



Parameters of Some New SSC-DSD Codes 


No. of 


Symbol 


Code Length of Best 


Code Length of 


Check Bits 


Size b 
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New Codes 


14 
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34x4 


15 
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1025x3 
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1025x4 
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Claims 

1. A method of producing reduced redundancy ECC code for a b-bit-per-package system comprising: 

a) determine the general form H parity check matrix of the corresponding sinple symbol correction- 
20 double symbol detection SSC-DSD code with symbol size b' and code length b N where there are N 

packages and the number of check bits is r; ' 

b) convert the general form H matrix of step a) into the standard form matrix H(S) where the first 
nonzero b xb submatrix of each of the N b'-column groups of the matrix is the b'xb' identity matrix; 

c construct the new code from H(s) by: 
25 i) first delete consistently the same set of e columns from each b'-column group of the H(s) matrix 

where e is the number of reduced bits from the general H matrix SSC-DSD code, 

ii) then delete the e all zero'es rows from the matrix formed in step c)i) to create a resultant matrix H(r) 
with symbol size (b'-e) = b and code length (b'-e)N and the number of check bits equal to r-e. 

2. The method of Claim 1 wherein b = 4, N = 20, code length equals 80 and number of check bits equals 
30 14 and the matrix H1 for generation of the code is: 
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3. The method of Claim 1 wherein e is equal to one. 

4. A method of creating a package error correction and detection code for a 4-bit per chip memory 
having N chips comprising: 

first, creating a known error correcting code matrix having a 5-bit symbol length for N packages and a 
45 code word length of 5N, 

second, rearranging the code matrix created in the first step so that the first nonzero 5x5 submatrix of 
each of the N 5-column groups of the matrix is the 5x5 identity matrix, 

third, deleting the same 1 -column groups from each 5-column group of the matrix formed in the second 
step, and 

so finally, deleting one all zero row from the matrix formed in the third step to create a resultant matrix 

with symbol size 4 and code length 4N. 
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